End-to-End Data Solutions for Distributed Petascale Science

نویسندگان

  • Jennifer M. Schopf
  • Ann Chervenak
  • Ian Foster
  • Dan Fraser
  • Dan Gunter
  • Nick LeRoy
  • Brian Tierney
چکیده

Jennifer M. Schopf, Ann Chervenak, Ian Foster, Dan Fraser, Dan Gunter, Nick LeRoy, Brian Tierney 1 Computation Institute, University of Chicago and Argonne National Laboratory 2 Mathematics and Computer Science Division, Argonne National Laboratory 3 Information Sciences Institute, University of Southern California 4 Department of Computer Science, University of Chicago 5 Lawrence Berkeley National Laboratory 6 Department of Computer Science, University of Wisconsin

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Zest: The Maximum Reliable TBytes/sec/$ for Petascale Systems

3 Abstract PSC has developed a prototype distributed file system infrastructure that vastly accelerates aggregated write bandwidth on large compute platforms. Write bandwidth, more than read bandwidth, is the dominant bottleneck in HPC I/O scenarios due to writing checkpoint data, visualization data and post-processing (multi-stage) data. We have prototyped a scalable solution on the Cray XT3 c...

متن کامل

The CEDPS Troubleshooting Architecture and Deployment on the Open Science Grid

Tracking failures and poor performance across a widely distributed system of resources has proven challenging for many ongoing DOE applications. An example is the Open Science Grid (OSG) project, which currently experiences a roughly 15% job failure rate. This can be an issue not only for Grid computing but for anyone performing large-scale data transfers to remote machines because of the large...

متن کامل

Real-time data access monitoring in distributed, multi- petabyte systems

Petascale systems are in existence today and will become common in the next few years. Such systems are inevitably very complex, highly distributed and heterogeneous. Monitoring a petascale system in real-time and understanding its status at any given moment without impacting its performance is a highly intricate task. Common approaches and off-theshelf tools are either unusable, do not scale, ...

متن کامل

Stork data scheduler: mitigating the data bottleneck in e-Science.

In this paper, we present the Stork data scheduler as a solution for mitigating the data bottleneck in e-Science and data-intensive scientific discovery. Stork focuses on planning, scheduling, monitoring and management of data placement tasks and application-level end-to-end optimization of networked inputs/outputs for petascale distributed e-Science applications. Unlike existing approaches, St...

متن کامل

Petascale Research in Earthquake System Science on Blue Waters (PressOn)

Broader Impacts. The Southern California Earthquake Center (SCEC) conducts a broad program of earthquake system science that seeks to develop a predictive understanding of earthquake processes with a practical mission aimed at providing society with improved understanding of seismic hazards. In partnership with earthquake engineers, SCEC researchers are developing the ability to conduct end-to-...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007